WE CLAIM: 



1. An isolated consensus DNA sequence of the BRCA1 coding sequence as set forth in 
SEQ. ID. NO.: 1. 

2. A consensus protein sequence of the BRCA1 protein as set forth in 
SEQ. ID. NO.: 2. 

3. An isolated coding sequence of the BRCA1 gene as set forth in 
SEQ. ID. NO.: 3. 

43 A protein sequence of the BRCA1 protein as set forth in 
SIQ. ID. NO.: 4 . 

SP An isolated coding sequence of the BRCA1 gene as set forth in 
SIQ. ID. NO.: 5. 

|y A protein sequence of the BRCA1 protein as set forth in SEQ. ID. NO.: 6. 

ff A BRCA1 gene with a BRCA1 coding sequence not associated with breast or 
ovarian cancer which comprises an alternative pair of codons, AGC and AGT t which 
occur at position 2201 at frequencies of about 35-45%, and from about 55-65%, 
respectively. 

8. A BRCA1 gene according to Claim 7 wherein AGC occurs at a frequency of about 
40%. 

9. A set of at least two alternative codon pairs which occur at polymorphic positions 
in a BRCA1 gene with a BRCA1 coding sequence not associated with breast or ovarian 
cancer, wherein codon pairs are selected from the group consisting of: 

• AGC and AGT at position 2201; 

• TTG and CTG at position 2430; 
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• CCG and CTG at position 2731; 

• GAA and GGA at position 3232; 

• AAA and AGA at position 3667; 
TCT and TCC at position 4427; and 
AGT and GGT at position 4956. 

10. A set of at least two alternative codon pairs according to claim 9, wherein the 
codon pairs occur in the following frequencies, respectively, in a population of 
individuals free of disease: 

• at position 2201, AGC and AGT occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 

•^i at position 2731, CCG and CTG occur at frequencies from about 25-35%, and 

from about 65-75%, respectively; 
V= at position 3232, GAA and GGA occur at frequencies from about 35-45%, and 

from about 55-65%, respectively; 
•f Li at position 3667, AAA and AGA occur at frequencies from about 35-45%, and from 
!V about 55-65%, respectively; 

•ry at position 4427, TCT and TCC occur at frequencies from about 45-55%, and from 
f " about 45-55%, respectively; and 

•C3 at position 4956, AGT and GGT occur at frequencies from about 35-45%, and from 
" about 55-65%, respectively. 

11 A set according to Claim 10 which is at least three codon pairs. 

12 A set according to Claim 10 which is at least four codon pairs. 

13. A set according to Claim 10 which is at least five codon pairs. 

14. A set according to Claim 10 which is at least six codon pairs. 
15 A set according to Claim 10 which is at least seven codon pairs. 
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16. A method of identifying individuals having a BRCA1 gene with a BRCA1 coding 
sequence not associated with disease, comprising: 

(a) amplifying a DNA fragment of an individual's BRCA1 coding sequence using 
an oligonucleotide primer which specifically hybridizes to sequences within 
the gene; 

(b) sequencing said amplified DNA fragment by dideoxy sequencing; 

(c) repeating steps (a) and (b) until said individual's BRCA1 coding sequence is 
completely sequenced; 

(d) comparing the sequence of said amplified DNA fragment to a BRCAl< omi ) 
DNA sequence, SEQ. ID. NOl, SEQ. ID. N03, or SEQ. ID. NOS; 

(e) determining the presence or absence of each of the following polymorphic 
variation in said individual's BRCA1 coding sequence: 

• AGC and AGT at position 220 1 . 
c ; • TTG and CTG at position 2430, 

H • CCG and CTG at position 273 1, 

%l i • GAA and GGA at position 3232, 

Crl • AAA and AGA at position 3667, 

r ] • TCT and TCC at position 4427, and 

f • AGT and GGT at position 4956; 

m (f) determining any sequence differences between said individual's BRCA1 
h coding sequences and SEQ. ID. NOl, SEQ. ID. N03, or SEQ. ID. N05 

q wherein the presence of said polymorphic variations and the absence of a 

° variation outside of positions 2201, 2430, 2731, 3232, 3667, 4427, and 

4956, is correlated with an absence of increased genetic susceptibility to 
breast or ovarian cancer resulting from a BRCA1 mutation in the BRCA1 
coding sequence. 

17. A method of claim 16 wherein, codon variations occur at the following 
frequencies, respectively, in a Caucasian population of individuals free of disease: 

• at position 2201, AGC and AGT occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 
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• at position 2731, CCG and CTG occur at frequencies from about 25-35%, and 
from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 35-45%, and 
from about 55-65%, respectively; 

• at position 3667, AAA and AGA occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 

• at position 4427, TCT and TCC occur at frequencies from about 45-55%, and from 
about 45-55%, respectively; and 

• at position 4956, AGT and GGT occur at frequencies from about 35-45%, and from 
about 55-65%, respectively. 

18. A method according to claim 16 wherein said oligonucleotide primer is labeled 
with a radiolabel, a fluorescent label a bioluminescent label, a chemiluminescent label, 
OK an enzyme label. 

lj|. A method of detecting a increased genetic susceptibility to breast and ovarian 
cpjncer in an individual resulting from the presence of a mutation in the BRCA1 coding 
^quence, comprising: 

r 5 (a) amplifying a DNA fragment of an individual's BRCA1 coding sequence using 
ry an oligonucleotide primer which specifically hybridizes to sequences within 

F~ the gene; 

C3 (b) sequencing said amplified DNA fragment by dideoxy sequencing; 
~~~ (c) repeating steps (a) and (b) until said individual's BRCA1 coding sequence is 
completely sequenced; 

(d) comparing the sequence of said amplified DNA fragment to a BRCAl(° mi) 
DNA sequence, SEQ. ID. NOl, SEQ. ID. N03, or SEQ. ID. NOS; 

(e) determining any sequence differences between said individual's BRCA1 
coding sequences and SEQ. ID. NOl, SEQ. ID. N03. or SEQ. ID. NOS; to 
determine the presence or absence of base changes in said individual's 
BRCA1 coding sequence wherein a base change which is not any one of the 
following: 

AGC and AGT at position 220 1 , 

• TTG and CTG at position 2430, 

• CCG and CTG at position 2731, 
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• GAA and GGA at position 3232, 

• AAA and AGA at position 3667, 

• TCT and TCC at position 4427, and 

• AGT and GGT at position 4956 is correlated with the potential of 
increased genetic susceptibility to breast or ovarian cancer resulting 
from a BRCA1 mutation in the BRCA1 coding sequence. 

20. A method of claim 19 wherein, codon variations occur at the following 
frequencies, respectively, in a population free of disease: 

• at position 2201, AGC and AGT occur at frequencies from about 40%, and from 
about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 35-45%, and from 
„ about 55-65%, respectively; 

•Ig at position 2731, CCG and CTG occur at frequencies from about 25-35%, and 
g 71 from about 65-75%, respectively; 

at position 3232, GAA and GGA occur at frequencies from about 35-45%, and 
?1 from about 55-65%, respectively; 

•f y at position 3667, AAA and AGA occur at frequencies from about 35-45%, and from 
l jl about 55-65%, respectively; 

•rU at position 4427, TCT and TCC occur at frequencies from about 45-55%, and from 
[j about 45-55%, respectively; and 

at position 4956, AGT and GGT occur at frequencies from about 35-45%, and from 
about 55-65%, respectively. 

21. A method according to claim 19 wherein said oligonucleotide primer is labeled 
with a radiolabel, a fluorescent label a bioluminescent label, a chemiluminescent label, 
or an enzyme label. 

22. A set of codon pairs, which occur at polymorphic positions in a BRCA1 gene with 
a BRCA1 coding sequence according to Claim 1, wherein said set of codon pairs is: 

AGC and AGT at position 220 1 ; 
TTG and CTG at position 2430; 

• CCG and CTG at position 273 1 ; 

• GAA and GGA at position 3232; 
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AAA and AGA at position 3667; 
TCT and TCC at position 4427; and 
AGT and GGT at position 4956. 



23. A set of at least two alternative codon pairs according to claim 22 wherein set of 
at least two alternative codon pairs occur at the following frequencies: 

• at position 2201, AGC and AGT occur at frequencies of about 40%, and from 
about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 

• at position 2731, CCG and CTG occur at frequencies from about 25-35%, and 
from about 65-75%, respectively; 

«H at position 3232, GAA and GGA occur at frequencies from about 35-45%, and 
Q from about 55-65%, respectively; 

n j at position 3667, AAA and AGA occur at frequencies from about 35-45%, and from 
z= about 55-65%, respectively; 

• j at position 4427, TCT and TCC occur at frequencies from about 45-55%, and from 
1 y about 45-55%, respectively; and 

%^ at position 4956, AGT and GGT occur at frequencies from about 35-45%, and from 
j = about 55-65%, respectively. 

A BRCA1 coding sequence according to claim 1 wherein the codon pairs occur at 
the following frequencies: 

• at position 2201, AGC and AGT occur at frequencies of about 40%, and from 
about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 

• at position 2731, CCG and CTG occur at frequencies from about 25-35%, and 
from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 35-45%, and 
from about 55-65%, respectively; 

• at position 3667, AAA and AGA occur at frequencies from about 35-45%, and from 
about 55-65%, respectively; 

• at position 4427, TCT and TCC occur at frequencies from about 45-55%, and from 
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about 45-55%, respectively; and 
• at position 4956, AGT and GGT occur at frequencies from about 35-45%, and from 
about 55-65%, respectively. 

25. A method of determining the consensus genomic sequence or consensus coding 
sequence for a target gene, comprising: 

a) screening a number of individuals in a population for a family history which indicates 
inheritance of normal alleles for a target gene; 

b) isolating at least one allele of the target gene from individuals found to have a family 
history which indicates inheritance of normal alleles for a target gene; 

c) sequencing each allele; 

d) comparing the nucleic acid sequence of the genomic sequence or of the coding sequence 
"5 of each allele of the target gene to determine similarities and differences in the nucleic 
^ acid sequence; and 

e^S determining which allele of the target gene occurs with the greatest frequency. 

261 A method of performing gene therapy, comprising: 

f~ a) transfecting cancer cell in vivo with an effective amount of a vector transformed 
[I with a BRCA1 coding sequences of SEQ. ID. NO.: 1, SEQ. ID. NO.: 3, or SEQ. ID. 

NO.: 5; 

C3 b) allowing the cells to take up the vector, and 
c) measuring a reduction in tumor growth. 

27. A method of performing protein therapy, comprising: 

a) injecting into a patient, an effective amount of BRCA1 tumor growth inhibiting 
protein of SEQ. ID. NO.: 2, SEQ. ID. NO.: 4, or SEQ. ID. NO.: 6; 

b) allowing the cells to take up the protein, and 

c) measuring a reduction in tumor growth. 
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